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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Hein, Mich B. 

Hiatt, Andrew C. 
Ma, Julian K.C. 

(ii) TITLE OF INVENTION: TRANSGENIC PLANTS EXPRESSING ASSEMBLED 

SECRETORY ANTIBODIES 

(iii) NUMBER OF SEQUENCES: 26 

H (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: THE SCRIPPS RESEARCH INSTITUTE 

(B) STREET: 10666 North Torrey Pines Road, TPC-8 

(C) CITY: La Jolla 

(D) STATE: California 
20 (E) COUNTRY: US 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
05 (3) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
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(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 03-MAY-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/591,823 

(B) FILING DATE: 02-OCT-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/427,765 
(3) FILING DATE: 27-OCT-1989 



(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Logan, April C. 
(3) REGISTRATION NUMBER: 33,950 
45 (C) REFERENCE /DOCKET NUMBER: 184.2 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (619) 554-2937 
(3) TELEFAX: (619) 554-6312 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
10 ~~ (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

15 

CCTTGACCGT AAGACATG 
(2) INFORMATION FOR SEQ ID NO : 2 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

35 

AATTCATGTC TTACGGTCAA GG 
(2) INFORMATION FOR SEQ ID NO : 3 : 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
TGTGAAAACC ATATTGAATT CCACCAATAC AAA 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATTTAGCACA ACATCCATGT CGACGAATTC AATCCAAAAA AG CAT 4 5 

(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGGGAGCTGG TGGTGGAATT CGTCGACCTT TGTCTCTAAC AC 42 
(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 0 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

5 CCATCCCATG GTTGAATTCA GTGTCGTCAG 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

25 CTGCAACTGG ACCTGCATGT CGACGAATTC AGCTCCTGAC AGGAG 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

45 CCTGTAGGAC CAGAGGAATT CGTCGACACT GGGATTATTT AC 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
5 (iv) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GAATTCATTC AAGAATAGTT CAAACAAGAA GATTACAAAC TATCAATTTC ATACACAATA SO 

75 

TAAACGATTA AAAGA 
15 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N-terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



c >- -1* Phe T^r Ala Val Leu Phe Ala Ala Ser Ser 
30 Met Arg Phe Pro Ser .le Phe T_ A- a ^ ^ 



ua L eu Ma Ma Pro vaX « ». Thr Or 31. «P «« «>r Ma oln 

20 25 

,eu Glu Gly Asp Phe 



lie Pro Ala Glu Ala Val He Gly Tyr Ser ~sp 

35 40 
ASP val Ala Val Leu Pro P*e Ser Asn Ser T.r Asn Asn Gly Leu Leu 

Phe xle Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 

rr- 70 



45 ser Leu Asp Leu Lys Arg Asp Val Val Leu 

85 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N-tertninal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 

i 5 - 10 

Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 



Ala Leu Ala Ala Pro 
20 



25 



30 



lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

Asu Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 

* 50 55 60 

Phe lie Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 



65 70 



Ser Leu Asp Leu Lys Arg Glu Val Glu Leu 



85 90 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:12: 

M.t Glu Leu Asp Leu Ser L,u Pro Leu Ser Gly Ala Ala Gly Gly Th, 

10 x ^ 



45 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
5 q (c) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

ACCAGATCTA TGGAATGGAC CTGGGTTTTT C 
10 (2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CCCAAGCTTG GTTTTGGAGA TGGTTTTCTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GATAAGCTTG GTCCTACTCC TCCTCCTCCT A 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 0 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 
AATCTCGAGT CAGTAGCAGA TGCCATCTCC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRII-TION: SEQ ID NO:17: 
GGAAAGCTTT GTACATATGC AAGGCTTACA 
35 (2) INFORMATION FOR SEQ ID NO: 18: 

(i ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: IS 
GATCTATGGC TCTCTTCTTG CTC 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:19: 

26 

AATTCTTATT CCGCACTCTG CACTGC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID 



GGCCGGGGTT 


ACGGGCTGGC 


CAGCAGGCTG 


TGCCCCCGAG 


TCCGGTCAGC 


AGGAGGGGAA 


60 


GAAGTGGCCT 


AAAATCTCTC 


CCGCATCGGC 


AGCCCAGGCC 


TAGTGCCCTA 


CCAGCCACCA 


120 


GCCATGGCTC 


TCTTCTTGCT 


CACCTGCCTG 


CTGGCTGTCT 


TTTCAGCGGC 


CACGGCACAA 


180 


AGCTCCTTAT 


TGGGTCCCAG 


CTCCATATTT 


GGTCCCGGGG 


AGGTGAATGT 


TTTGGAAGGC 


240 


GACTCGGTGT 


CCATCACATG 


CT ACT AC CCA 


ACAACCTCCG 


TCACCCGGCA 


CAGCCGGAAG 


300 


TTCTGGTGCC 


GGGAAGAGGA 


GAGCGGCCGC 


TGCGTGACGC 


TTGCCTCGAC 


CGGCTACACG 


360 


TCCCAGGAAT 


ACTCCGGGAG 


AGGCAAGCTC 


ACCGACTTCC 


CTGATAAAGG 


GGAGTTTGTG 


420 
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GTGACTGTTG ACCAACTCAC CCAGAACGAC TCAGGGAGCT ACAAGTGTGG CGTGGGAGTC 
AACGGCCGTG GCCTGGACTT CGGTGTCAAC GTGCTGGTCA GCCAGAAGCC AGAGCCTGAT 
GACGTTGTTT ACAAACAATA TGAGAGTTAT ACAGTAACCA TCACCTGCCC TTTCACATAT 
GCGACTAGGC AACTAAAGAA GTCCTTTTAC AAGGTGGAAG ACGGGGAACT TGTACTCATC 
ATTGATTCCA GCAGTAAGGA GGCAAAGGAC CCCAGGTATA AGGGCAGAAT AACGTTGCAG _ 
ATCCAAAGTA CCACAGCAAA AGAATTCACA GTCACCATCA AGCATTTGCA GCTCAATGAT 
GCTGGGCAGT ATGTCTGCCA GAGTGGAAGC GACCCCACTG CTGAAGAACA GAACGTTGAC 
CTCCGACTGC TAACTCCTGG TCTGCTCTAT GGAAACCTGG GGGGCTCGGT GACCTTTGAA 
TGTGCCCTGG ACTCTGAAGA CGCAAACGCG GTAGCATCCT TGCGCCAGGT TAGGGGTGGC 
AATGTGGTCA TTGACAGCCA GGGGACAATA GATCCAGCCT TCGAGGGCAG GATCCTGTTC 
20 ACCAAGGCTG AGAACGGCCA CTTCAGTGTA GTGATCGCAG GCCTGAGGAA GGAAGACACA 
GGGAACTATC TGTGCGGAGT CCAGTCCAAT GGTCAGTCTG GGGATGGGCC CACCCAGCTT 
25 CGGCAACTCT TCGTCAATGA AGAGATCGAC GTGTCCCGCA GCCGCCCTGT GTTGAAGGGC 
TTTCCAGGAG GCTCCGTGAC CATACGCTGC CCCTACAACC CGAAGAGAAG CGACAGCCAC 
CTGCAGCTGT ATCTCTGGGA AGGGAGTCAA ACCCGCCATC TGCTGGTGGA CAGCGGCGAG 
30 GGGCTGGTTC AGAAAGACTA CACAGGCAGG CTGGCCCTGT TCGAAGAGCC TGGCAATGGC 
ACCTTCTCAG TCGTCCTCAA CCAGCTCACT GCCGAGGATG AAGGCTTCTA CTGGTGTGTC 
AGCGATGACG ATGAGTCCCT GACGACTTCG GTGAAGCTCC AGAXCGTTGA CGGAGAACCA 
AGCCCCACGA TCGACAAGTT CACTGCTGTG CAGGGAGAGC CTGTTGAGAT CAGCTGCCAC 
TTCCCATGCA AATACTTCTC CTCCGAGAAG TACTGGTGCA AGTGGAATGA CCATGGCTGC 
GAGGACCTGC CCACTAAGCT CAGCTCCAC-C GGCGACCTTG TGAAATGCAA CAACAACCTG 
GTCCTCACCC TGACCTTGGA CTCGGTCAGC GAAGATGACG AGGGCTGGTA CTGGTGTGGC 
GCGAAAGACG GGCACGAGTT TGAAGAGGTT GCGGCCGTCA GGGTGGAGCT GACAGAGCCA 
GCCAAGGTAG CTGTCGAGCC AGCCAAGGTA CCTGTCGACC CAGCCAAGGC AGCCCCCGCG 
CCTGCTGAGG AGAAGGCCAA GGCGCGGTGC CCAGTGCCCA GGAGAAGGCA GTGGTACCCA 
50 „.^„„^rn ■vrrTT.rr^ GGAGGTAGCA 1980 
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TTGTCAAGGA AGCTGAGAAC AAGTTGTCCA GAACCTCGGC TCCTTGCGGA GGAGGTAGCA 
GTGCAGAGTG CGGAAGACCC AGCCAGTGGG AGCAGAGCGT CTGTGGATGC CAGCAGTGCT 
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TCGGGACAAA GCGGGAGTGC CAAAGTACTG ATCTCCACCC TGGTGCCCTT GGGGCTGGTG 
CTGGCAGCGG GGGCCATGGC CGTGGCCATA GCCAGAGCCC GGCACAGGAG GAACGTGGAC 
CGAGTTTCCA TCGGAAGCTA CAGGACAGAC ATTAGCATGT CAGACTTGGA GAACTCCAGG 
GAGTTCGGAG CCATTGACAA CCCAAGCGCC TGCCCCGATG CCCGGGAGAC GGCCCTCGGA 
GGAAAGGATG AGTTAGCGAC GGCCACCGAG AGCACCGTGG AGATTGAGGA GCCCAAGAAG 
GCAAAACGGT CATCCAAGGA AGAAGCCGAC CTGGCCTACT CAGCTTTCCT GCTCCAATCC 
AAC AC CAT AG CTGCTGAGCA CCAAGATGGC CCCAAGGAGG CCTAGGCACA GCCGGCCACC 
GCCGCCGCCG CCACCGCCGC CGCCGCCGCC ACCTGTGAAA ATCACCTTCC AGAATCACGT 
TGATCCTCGG GGTCCCCAGA GCCGGGGGCT CAACCGCCCT GCACCCCCCA TGTCCCCACC 
ACCTAAACTT CCCTACCTGT GCCCAGAGGT GTGCTGGTCC CCTCCTCCAC GGCATCCAGG 
CCTGGCTCAA TGTTCCCGTT GGGGTGGGGG TGTGAGGGGT TCCTACTTGC AGCCCGGTTC 
TCCCGAGAGA AGCTAAGGAT CCAGGTCCTG AGGGAGGGGC CTCTCGAAGG CAGACAGACC 
AGAGAGGGGG GAGGAGCCCT TGGATGGGAG GCCAGAGGCG CTTTCCGGCC ACCCCCTCCC 
TCCCTGCCCC CACCCTCCTT CCTTCATTCA AAAGTCCCAG TGGCTGCTGC CTAGGGTCCA 
GGCGCTGGCC GCACGCCTCC TCGAAGCCGT TGTGCAAACA TCACTGGAGG AAGCCAGGGC 
TCCTCCCGGG CTGTGTATCC TCACTCAGGC ATCCTGTCCT CCCCAGTATC AGGAGATGTC 
AAGCwTCTGA AGGCTGTGTG CCCTGGGCGT GTCTGCAAGT CACCCCAGAC ACATGTTCTC 
GCCATTTTAC AGATGAGAAC ACTGAGGTTG TACTCAAGGG CACCCTGCGA GATGGAGCAA 
CAGCAAACTA GATGGGCTTC TGCTGTCCTC TTGGCCAGAG GTCTCTCCAC AGGAGCCCCT 
GCCCCTGTAG GAAGCAGAGT TTTAGAACAT GGAAGAAGAA GAGGGGGATG GCCCTGGACG 
CTGACCTCTC CCAAGCCCCC ACGGGGGAAA AGGCCCCCTC CTTTTCTGTC ACTCTCGGGG 
ACCTGCGGAG TTGAGCATTC GTGCCCCGTG TGTCTGAAGA GTTCCCAGTG GAAAGAAGAA 
AAGAGGGTGT TTGTCAGTGC CGGGGAGGGC CTGATCCCCA GACAGCTGAA GTTTAAGGTC 
CTTGTCCCTG TGAGCTTTAA CCAGCACCTC CGGGCTGACC CTTGCTAACA CAT CAGAAAT 
GTGATTTAAT CATTAAACAT TGTGATTGCC ACTGGGA 
(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 
15 (3) LOCATION: 1..1875 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

ATG GCT CTC TTC TTG CTC ACC TGC CTG CTG GCT GTC TTT TCA GCG GCC 
Met SI Leu Phe Leu Leu Thr Cys Leu Leu Ala Val Phe Ser A a Ala 



1 5 



10 



ACG GCA CAA AGC TCC TTA TTG GGT CCC AGC TCC ATA TTT GGT CCC GGG 

Thr Ala Gin Ser Ser Leu Leu Gly Pro Ser Ser lie Phe Gly Pro Gly 



20 



25 



65 70 75 



CAG GAA TAG TCC GGG AGA GGC AAG CTC ACC GAC TTC CCT GAT AAA GGG 
Gin Su Tyr Ser Gly Arg Gly Lys Leu Thr Asp Phe Pro Asp Lys Gly 



95 



GAG TTT GTG GTG ACT GTT GAC CAA CTC ACC CAG AAC GAC TCA GGG AGC 
45 Glu Phe Val Val Thr Val Asp Gin Leu Thr Gin Asn Asp er Gly Ser 

100 105 110 

TAC AAG TGT GGC GTG GGA GTC AAC GGC CGT GGC CTG GAC TTC GGT GTC 
Tyr Lys Cys Gly Val Gly Val Asn Gly Arg Gly Leu Asp Phe Gly Va. 
50 H5 "0 I 25 

AAC G-G C-G GTC AGC CAG AAG CCA GAG CCT GAT GAC GTT GTT TAC AAA 
J£ val Leu VaV Ser Gin Lys Pro Glu Pro Asp Asp Val Val Tyr Lys 



43 
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192 



GAG GTG AAT GTT TTG GAA GGC GAC TCG GTG TCC ATC ACA TGC TAC TAC 
Glu Val Asn Val Leu Glu Gly Asp Ser Val Ser He Thr Cys Tyr Tyr 
30 35 40 45 

CCA ACA ACC TCC GTC ACC CGG CAC AGC CGG AAG TTC TGG TGC CGG GAA 
Pro Thr tIz Ser Val Thr Arg His Ser Arg Lys Phe Trp Cys Arg Glu 

5- 6 0 

35 « GAG AGC GGC CGC TGC GTG ACG CTT GCC TCG ACC GGC TAC ACG TCC 

Glu Glu Ser Gly Arg Cys val Thr Leu Ala ser Thr Gly Tyr Thr ser 



288 
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135 140 
130 1Jb 



n~>* arr AC^ TGC CCT TTC ACA TAT GCG 

CAA TAT GAG AGT TAT ACA G.A ACC ^ AC T ^ ^ ^ 

Gin Tyr Glu Ser Tyr Thr Val Thr lie inr y ^ 
145 ISO 

TCC TTT TAC AAG GTG GAA GAC GGG GAA CTT 
ACT AGG CAA CTA AAG AAG TCC TTT TAC AA ^ ^ ^ 

Thr Arg Gin Leu Lys Lys Ser Phe Tyr uy ^ 

165 1/u 

GTA CTC A T C ATT GAT TCC AGC AGT AAG GAG GCA AAG GAC CCC AGG TAT 
Zl Zl lie lie Asp Ser Ser Ser Lys Glu Ala Lys Asp Pro Arg Tyr 
180 135 

aar CCC AGA ATA ACG TTG CAG ATC CAA AGT ACC ACA GCA AAA GAA TTC 
Ty S I- i£ Leu Gin Xle Gin Ser Thr Thr Ala Lys Glu Ph. 

195 2 00 

A , A GTC ACC ATC AAG CAT TTG CAG CTC AAT GAT GCT GGG CAG TAT GTC 
Val Thr lie Lys His Leu Gin Leu Asn Asp Ala Gly G,n ^ ^ 
210 2xi3 

^ *rr r^r C^C ACT GCT GAA GAA CAG AAC GTT GAC CTC 
TGC CAG AGT GGA AGC GAC CC ACT tu ^ Leu 
Cys Gin Ser Gly Ser Asp Pro Thr Ala Glu Glu Cln Asn . ^ 



„ n 235 
25 225 230 



cga cta act ccr = f =» - tat gga £ ctg «= - ™ 

Arg Leu Leu Thr Pro Gly Leu Leu v x 2 _ 

245 250 

ACC TTT GAA TGT GCC CTG GAC TCT GAA GAC GCA AAC GCG GTA GCA TCC 
Thr Phe Glu Cys Ala Leu Asp Ser Glu Asp A.a A a n A.a ^ 
260 265 

, ,™ r"r rT lit GAC AGC CAG GGG ACA 

™ s - - s s s s - - - - - - - 

275 280 

ATA GAT CCA GCC TTC GAG GGC AGG ATC CTG TTC ACC AAG GCT GAG AAC 
He Asp Pro Ala Phe Glu Gly Arg He Leu Phe Thr Lys A-a 
290 295 

CCC CAC T-C AGT GTA GTG ATC GCA GGC CTG AGG AAG GAA GAC ACA GGG 
Z Ss Phe sfr val Val He Ala Gly Leu Arg Lys Glu Asp Thr G.y 
305 310 315 

TAT CTG TGC GGA GTC CAG TCC AAT GGT CAG TCT GGG GAT GGG CCC 
Asn Tyr Leu Cys Gly Val Gin Ser Asn Gly Gin Ser G.y Asp «-y 
325 3iU 

ACC CAG CTT CGG CAA CTC TTC GTC AAT «X GAG ATC «C GTG TCC CCC 

T- Gin Leu Arg Gla Leu Phe Val Asa Glu G.u lie Asp va 

^4=; ■ 5 - u 



430 



528 



624 



672 



720 



763 



854 



912 



960 



1008 
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AGC CCC CCT GTG TTG AAG GGC TTT CCA GGA GGC TCC GTG ACC ATA CGC 
Ser Pro Pro Val Leu Lys Gly Phe Pro Gly Gly Ser Val Thr lie Arg 

- - - 3 6 5 
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360 



TGC CCC TAC AAC CCG AAG AGA AGC GAC AGC CAC CTG CAG CTG TAT CTC 
Cys Pro Tyr Tsn Pro Lys Arg Ser Asp Ser His Leu Gin Leu Tyr Leu 
370 375 380 

TGG GAA GGG ACT CAA ACC CGC CAT CTG CTG GTG GAC AGC GGC GAG GGG 
IZ ITu Gly Ser Gin Thr Arg His Leu Leu Val Asp Ser Gly Glu Gly 
335 390 395 

CTG GTT CAG AAA GAC TAC ACA GGC AGG CTG GCC CTG TTC GAA GAG CCT 
Tel 111 Gin Lys Asp Tyr Thr Gly Arg Leu Ala Leu Phe Glu Glu Pro 
405 410 415 

,„n, nrn A rr TTr TCA GTC GTC CTC AAC CAG CTC ACT GCC GAG GAT 
Gly Tsn Gly THr Phe Ser Val Val Leu Asn Gin Leu Thr Ala Glu Asp 



420 



425 



GAA GGC TTC TAC TGG TGT GTC AGC GAT GAC GAT GAG TCC CTG ACG AC, 
Glu Gly Phe Tyr Trp Cys Val Ser Asp Asp Asp Glu Ser Leu Thr T.r 
435 440 445 

TCT GTG AAG CTC CAG ATC GTT GAC GGA GAA CCA AGC CCC ACG ATC GAC 
Ser Val ^s Leu Gin lie Val Asp Gly Glu Pro Ser Pro Thr lie Asp 

Acc 460 
450 455 

AAG TTC ACT GCT GTG CAG GGA GAG CCT GTT GAG ATC ACC TGC CAC TTC 
Lys Phe Thr Ala Val Gin Gly Glu Pro Val Glu lie Thr Cys H.s Phe 
465 470 475 

C-A TGC AAA TAC TTC TCC TCC GAG AAG TAC TGG TGC AAG TGG AAT GAC 
Pro" Cys Lys Tyr Phe Ser Ser Glu Lys Tyr Trp Cys Lys Trp Asn Asp 



485 



490 



CAT GGC TGC GAG GAC CTG CCC ACT AAG CTC AGC TCC AGC GGC GAC CTT 
His Gly Cys Glu Asp Leu Pro Thr Lys Leu Ser Ser Ser Gly Asp Leu 
500 505 51 

GTG AAA TGC AAC AAC AAC CTG GTC CTC ACC CTG ACC TTG GAC TCG GTC 
Zl Lys Cys Asn Asn Asn Leu Val Leu Thr Leu Thr Leu Asp Ser Val 
515 520 525 

AGC GAA GAT GAC GAG GGC TGG TAC TGG TGT GGC GCG AAA GAC GGG CAC 
Ser Glu Aso Asp Glu Gly Trp Tyr Trp Cys Gly Ala Lys Asp Gly His 
530 * 535 540 

GAG TT GAA GAG GTT GCG GCC GTC AGG GTG GAG CTG ACA GAG CCA GCC 
Glu Phi g" Glu Val Ala Ala Val Arg Val Glu Leu Thr Glu Pro A a 
545 550 555 

AAG GTA GCT GTC GAG CCA GCC AAG GTA CCT GTC GAC CCA GCC AAG GCA 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 
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20 



25 



40 



Lys Val Ala Val Glu Pro Ala Lys Val Pro Val Asp Pro Ala Lys Ala 
565 570 

GC c ccc oca cct gct gag gag aag gcc aag gcg coo tgc cca gtg ccc 
Ala Pro Ala Pro Ala Glu Glu Lys Ala Lys Ala Arg cys ^ 



580 585 



s 5 s ss s s e = s s £ = S s = " 

10 595 600 

ca «» CCT CSC CTC CXT cc, «. « £ „ ™ « - « «* 

Pro Glu Pro Arg Leu Leu Ala Glu Glu val Axa va 



615 



620 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Met Ala Leu Phe Leu Leu Thr Cys Leu Leu Ala Val Phe Ser Ala Ala 
1 5 10 

35 Thr Ala Gin Ser Ser Leu Leu Gly Pro Ser Ser lie Phe Gly Pro Gly 



20 25 



Giu val Asn Val Leu Glu Gly Asp Ser Val Ser He Thr Cys Tyr Tyr 
35 40 45 

, „ ,r i at- k-s Se- Arq Lys Phe Trp Cys Arg Glu 

Pro Thr Thr Ser Val Thr Ar a rixS be. x 



50 



Glu Glu Ser Gly Arg Cys Val Thr Leu Ala Ser Thr Gly Tyr Thr Ser 
45 65 70 

Gin Glu Tyr Ser Gly Arg Gly Lys Leu Thr Asp Phe Pro Asp Lys Gly 

85 90 

50 Glu Phe Val Val Thr Val Asp Gin Leu Thr Gin Asn Asp Ser Gly Ser 

100 105 
Tyr Lys Cys Gly Val Gly Val Asn Gly Arg Gly Leu Asp Phe Gly Val 



1776 



1824 



1872 



610 

15 1875 

TA r 

625 



-11 



20 



30 



35 



50 



115 



120 125 



Asn Val Leu Val Ser Gin Lys Pro Glu Pro Asp Asp Val Val Tyr Lys 

130 135 1« 

Gin Tyr Glu Ser Tyr Thr Val Thr lie Thr Cys Pro Phe Thr Tyr Ala 

150 155 16° 



145 

Th: 



^ Arg Gin Leu Lys Lys Ser Phe Tyr Lys Val Glu Asp Gly Glu Leu 
10 " 165 170 175 

Val Leu lie lie Asp Ser Ser Ser Lys Glu Ala Lys Asp Pro Arg Tyr 

180 185 19° 

15 Lys Gly Arg He Thr Leu Gin lie Gin Ser Thr Thr Ala Lys Glu Phe 

195 200 205 

Thr Van T hr lie Lys His Leu Gin Leu Asn Asp Ala Gly Gin Tyr Val 
210 215 220 

Cys Gin Ser Gly Ser Asp Pro Thr Ala Glu Glu Gin Asn Val Asp Leu 

im 235 240 

225 230 



Arg Leu Leu Thr Pro Gly Leu Leu Tyr Gly Asn Leu Gly Gly Ser Val 
25 245 250 255 



Thr Phe Glu Cys Ala Leu Asp Ser Glu Asp Ala Asn Ala Val Ala Ser 

260 2S5 270 

Leu Arg Gin Val Arg Gly Gly Asn Val Val lie Asp Ser Gin Gly Thr 



275 



280 285 



lie Asp Pro Ala Phe Glu Gly Arg lie Leu Phe Thr Lys Ala Glu Asn 
290 295 300 

Gly His Phe Ser Val Val lie Ala Gly Leu Arg Lys Glu Asp Thr Gly 
305 310 315 320 

Asn Tyr Leu Cys Gly Val Gin Ser Asn Gly Gin Ser Gly Asp Gly Pro 
40 325 330 335 

Thr Gin Leu Arg Gin Leu Phe Val Asn Glu Glu He Asp Val Ser Arg 
340 345 350 

45 Ser Pro Pro Val Leu Lys Gly Phe Pro Gly Gly Ser Val Thr lie Arg 

355 360 36= 



Cys Pro Tyr Asn Pro Lys Arg Ser As? Ser His Leu Gin Leu Tyr Leu 
370 375 380 

Tro Glu Gly Ser Gin Thr Arg His Leu Leu Val Asp Ser Gly Glu Gly 

^ _ tqc 400 

385 390 33b 
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10 



15 



30 



45 



Leu Val Gin Lys Asp Tyr Thr Gly Arg Leu Ala Leu Phe Glu Glu Pro 
405 



410 415 



Gly Asn Gly Thr Phe Ser Val Val Leu Asn Gin Leu Thr Ala Glu Asp 
420 425 430 

Glu Gly Phe Tyr Trp Cys Val Ser Asp Asp Asp Glu Ser Leu Thr Thr 
435 440 445 

Ser Val Lys Leu Gin He Val Asp Gly Glu Pro Ser Pro Thr lie Asp 



450 



455 460 



Lys Phe Thr Ala Val Gin Gly Glu Pro Val Glu lie Thr Cys His Phe 
465 470 475 480 

p-o Cys Lvs Tvr Phe Ser Ser Glu Lys Tyr Trp Cys Lys Trp Asn Asp 
485 490 495 

H^ s G iy Cys Glu Aso Leu Pro Thr Lys Leu Ser Ser Ser Gly Asp Leu 

20 500 505 510 

Val Lvs Cys Asn Asn Asn Leu Val Leu Thr Leu Thr Leu Asp Ser Val 
' 515 520 525 

25 Ser Glu Asp Asp Glu Gly Trp Tyr Trp Cys Gly Ala Lys Asp Gly His 



530 



535 540 



Glu Phe Glu Glu Val Ala Ala Val Arg Val Glu Leu Thr Glu Pro Ala 



560 

545 



550 555 



Lys Val Ala Val Glu Pro Ala Lys Val Pro Val Asp Pro Ala Lys Ala 
565 570 575 

Ala Pro Ala Pro Ala Glu Glu Lys Ala Lys Ala Arg Cys Pro Val Pro 
35 530 535 590 

Arg Arg Arg Gin Trp Tyr Pro Leu Ser Arg Lys Leu Arg Thr Ser Cys 
595 600 605 

40 Pro Glu Pro Arg Leu Leu Ala Glu Glu Val Ala Val Gin Ser Ala Glu 



610 



615 S20 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 23 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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10 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GATCTATGAA GACCCACCTG CTT 
(2) INFORMATION FOR SEQ ID NO: 24: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24 

30 

AATTCTTAGA CAGGGTAGCA AG A 
(2) INFORMATION FOR SEQ ID NO: 25: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
45 (iv) ANTI-SENSE : NO 



(ix) FEATURE : 

(A) NAME /KEY : CDS 
50 (B) LOCATION: 1. .480 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25 
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48 



ATG AAG ACC CAC CTG CTT CTC TGG GGA GTC CTC GCC ATT TTT GTT AAG 
Met Lys Thr Kis Leu Leu Leu Trp Gly Val Leu Ala He Phe Val Lys 
15 10 15 

5 GTT GTC CTT GTA ACA GGT GAC GAC GAA GCG ACC ATT CTT GCT GAC AAC 96 

Val Val Leu Val Thr Gly Asp Asp Glu Ala Thr He Leu Ala Asp Asn 
20 25 30 

AAA TGC ATG TGT ACC CGA GTT ACC TCT AAA ATC ATC CCT TCC ACC GAG_ 144 
10 Lys Cys Met Cys Thr Arg Val Thr Ser Lys He He Pro Ser Thr Glu 

35 40 45 



GAT CCT AAT GAG GAC ATT GTG GAG AGA AAT ATC CGA ATT GTT GTC CCT 
Asp Pro Asn Glu Asp He Val Glu Arg Asn He Arg He Val Val Pro 
15 50 55 60 

TTG AAC AAC AGG GAG AAT ATC TCT GAT CCC ACC TCC CCA CTG AGA AGG 

Leu Asn Asn Arg Glu Asn He Ser Asp Pro Thr Ser Pro Leu Arg Arg 

65 70 75 30 

20 

AAC TTT GTA TAC CAT TTG TCA GAC GTC TGT AAG AAA TGC GAT CCT GTG 

Asn Phe Val Tyr His Leu Ser Asp Val Cys Lys Lys Cys Asp Pro Val 

85 9° 95 

25 GAA GTG GAG CTG GAA GAT CAG GTT GTT ACT GCC ACC CAG AGC AAC ATC 

Glu Val Glu Leu Glu Asp Gin Val Val Thr Ala Thr Gin Ser Asn He 
100 105 HO 

TGC AAT GAA GAC GAT GGT GTT CCT GAG ACC TGC TAC ATG TAT GAC AGA 
30 Cys Asn Glu Asp Asp Gly Val Pro Glu Thr Cys Tyr Met Tyr Asp Arg 

115 120 I 25 

AAC AAG TGC TAT ACC ACT ATG GTC CCA CTT AGG TAT CAT GGT GAG ACC 
Asn Lys Cys Tyr Thr Thr Met Val Pro Leu Arg Tyr His Gly Glu Thr 
35 130 135 140 

AAA ATG GTG CAA GCA GCC TTG ACC CCC GAT TCT TGC TAC CCT GAC TA.' 
Lys Met Val Gin Ala Ala Leu Thr Pro Asp Ser Cys Tyr Pro Asp 
145 150 155 160 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 15 9 amino acids 

(3) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
Met Lys Thr His Leu Leu Leu Trp Gly Val Leu Ala He Phe Val Lys 



192 



240 



288 



336 



384 



432 



480 
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10 



15 



10 



15 



VaT Val Leu Val Thr Glv Asp Asp Glu Ala Thr He Leu Ala Asp Asn 
20 2S 30 

Lys Cys Met Cys Thr Arg Val Thr Ser Lys lie lie Pro Ser Thr Glu 



35 



40 



45 



Asp Pro Asn Glu Asp He Val Glu Arg Asn He Arg He Val Val Pro 
50 55 60 

Leu Asn Asn Arg Glu Asn He Ser Asp Pro Thr Ser Pro Leu Arg Arg 

75 80 



S5 



70 



Asn Phe Val Tyr His Leu Ser Asp Val Cys Lys Lys Cys Asp Pro Val 

90 95 



85 



20 



25 



Glu Val Glu Leu Glu Asp Gin Val Val Thr Ala Thr Gin Ser Asn He 
100 105 HO 

Cys Asn Glu Asp Asp Gly Val Pro Glu Thr Cys Tyr Met Tyr Asp Arg 
115 120 125 

Asn Lys Cys Tyr Thr Thr Met Val Pro Leu Arg Tyr His Gly Glu Thr 
130 135 140 

Lys Met Val Gin Ala Ala Leu Thr Pro Asp Ser Cys Tyr Pro Asp 
145 150 155 
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